227 research outputs found

    EntrezAJAX: direct web browser access to the Entrez Programming Utilities

    Get PDF
    Web applications for biology and medicine often need to integrate data from Entrez services provided by the National Center for Biotechnology Information. However, direct access to Entrez from a web browser is not possible due to 'same-origin' security restrictions. The use of "Asynchronous JavaScript and XML" (AJAX) to create rich, interactive web applications is now commonplace. The ability to access Entrez via AJAX would be advantageous in the creation of integrated biomedical web resources. We describe EntrezAJAX, which provides access to Entrez eUtils and is able to circumvent same-origin browser restrictions. EntrezAJAX is easily implemented by JavaScript developers and provides identical functionality as Entrez eUtils as well as enhanced functionality to ease development. We provide easy-to-understand developer examples written in JavaScript to illustrate potential uses of this service. For the purposes of speed, reliability and scalability, EntrezAJAX has been deployed on Google App Engine, a freely available cloud service. The EntrezAJAX webpage is located at http://entrezajax.appspot.com

    Defining bacterial species in the genomic era : insights from the genus Acinetobacter

    Get PDF
    Background: Microbial taxonomy remains a conservative discipline, relying on phenotypic information derived from growth in pure culture and techniques that are time-consuming and difficult to standardize, particularly when compared to the ease of modern high-throughput genome sequencing. Here, drawing on the genus Acinetobacter as a test case, we examine whether bacterial taxonomy could abandon phenotypic approaches and DNA-DNA hybridization and, instead, rely exclusively on analyses of genome sequence data. Results: In pursuit of this goal, we generated a set of thirteen new draft genome sequences, representing ten species, combined them with other publically available genome sequences and analyzed these 38 strains belonging to the genus. We found that analyses based on 16S rRNA gene sequences were not capable of delineating accepted species. However, a core genome phylogenetic tree proved consistent with the currently accepted taxonomy of the genus, while also identifying three misclassifications of strains in collections or databases. Among rapid distance-based methods, we found average-nucleotide identity (ANI) analyses delivered results consistent with traditional and phylogenetic classifications, whereas gene content based approaches appear to be too strongly influenced by the effects of horizontal gene transfer to agree with previously accepted species. Conclusion: We believe a combination of core genome phylogenetic analysis and ANI provides an appropriate method for bacterial species delineation, whereby bacterial species are defined as monophyletic groups of isolates with genomes that exhibit at least 95% pair-wise ANI. The proposed method is backwards compatible; it provides a scalable and uniform approach that works for both culturable and non-culturable species; is faster and cheaper than traditional taxonomic methods; is easily replicable and transferable among research institutions; and lastly, falls in line with Darwin’s vision of classification becoming, as far as is possible, genealogical

    High-throughput sequencing of 16S rRNA gene amplicons : effects of extraction procedure, primer length and annealing temperature

    Get PDF
    The analysis of 16S-rDNA sequences to assess the bacterial community composition of a sample is a widely used technique that has increased with the advent of high throughput sequencing. Although considerable effort has been devoted to identifying the most informative region of the 16S gene and the optimal informatics procedures to process the data, little attention has been paid to the PCR step, in particular annealing temperature and primer length. To address this, amplicons derived from 16S-rDNA were generated from chicken caecal content DNA using different annealing temperatures, primers and different DNA extraction procedures. The amplicons were pyrosequenced to determine the optimal protocols for capture of maximum bacterial diversity from a chicken caecal sample. Even at very low annealing temperatures there was little effect on the community structure, although the abundance of some OTUs such as Bifidobacterium increased. Using shorter primers did not reveal any novel OTUs but did change the community profile obtained. Mechanical disruption of the sample by bead beating had a significant effect on the results obtained, as did repeated freezing and thawing. In conclusion, existing primers and standard annealing temperatures captured as much diversity as lower annealing temperatures and shorter primers

    Calculating Orthologs in Bacteria and Archaea: A Divide and Conquer Approach

    Get PDF
    Among proteins, orthologs are defined as those that are derived by vertical descent from a single progenitor in the last common ancestor of their host organisms. Our goal is to compute a complete set of protein orthologs derived from all currently available complete bacterial and archaeal genomes. Traditional approaches typically rely on all-against-all BLAST searching which is prohibitively expensive in terms of hardware requirements or computational time (requiring an estimated 18 months or more on a typical server). Here, we present xBASE-Orth, a system for ongoing ortholog annotation, which applies a “divide and conquer” approach and adopts a pragmatic scheme that trades accuracy for speed. Starting at species level, xBASE-Orth carefully constructs and uses pan-genomes as proxies for the full collections of coding sequences at each level as it progressively climbs the taxonomic tree using the previously computed data. This leads to a significant decrease in the number of alignments that need to be performed, which translates into faster computation, making ortholog computation possible on a global scale. Using xBASE-Orth, we analyzed an NCBI collection of 1,288 bacterial and 94 archaeal complete genomes with more than 4 million coding sequences in 5 weeks and predicted more than 700 million ortholog pairs, clustered in 175,531 orthologous groups. We have also identified sets of highly conserved bacterial and archaeal orthologs and in so doing have highlighted anomalies in genome annotation and in the proposed composition of the minimal bacterial genome. In summary, our approach allows for scalable and efficient computation of the bacterial and archaeal ortholog annotations. In addition, due to its hierarchical nature, it is suitable for incorporating novel complete genomes and alternative genome annotations. The computed ortholog data and a continuously evolving set of applications based on it are integrated in the xBASE database, available at http://www.xbase.ac.uk/

    Bioinformatics analysis of the locus for enterocyte effacement provides novel insights into type-III secretion

    Get PDF
    BACKGROUND: Like many other pathogens, enterohaemorrhagic and enteropathogenic strains of Escherichia coli employ a type-III secretion system to translocate bacterial effector proteins into host cells, where they then disrupt a range of cellular functions. This system is encoded by the locus for enterocyte effacement. Many of the genes within this locus have been assigned names and functions through homology with the better characterised Ysc-Yop system from Yersinia spp. However, the functions and homologies of many LEE genes remain obscure. RESULTS: We have performed a fresh bioinformatics analysis of the LEE. Using PSI-BLAST we have been able to identify several novel homologies between LEE-encoded and Ysc-Yop-associated proteins: Orf2/YscE, Orf5/YscL, rORF8/EscI, SepQ/YscQ, SepL/YopN-TyeA, CesD2/LcrR. In addition, we highlight homology between EspA and flagellin, and report many new homologues of the chaperone CesT. CONCLUSION: We conclude that the vast majority of LEE-encoded proteins do indeed possess homologues and that homology data can be used in combination with experimental data to make fresh functional predictions

    Gene doctoring: a method for recombineering in laboratory and pathogenic Escherichia coli strains

    Get PDF
    Background: Homologous recombination mediated by the lambda-Red genes is a common method for making chromosomal modifications in Escherichia coli. Several protocols have been developed that differ in the mechanisms by which DNA, carrying regions homologous to the chromosome, are delivered into the cell. A common technique is to electroporate linear DNA fragments into cells. Alternatively, DNA fragments are generated in vivo by digestion of a donor plasmid with a nuclease that does not cleave the host genome. In both cases the lambda-Red gene products recombine homologous regions carried on the linear DNA fragments with the chromosome. We have successfully used both techniques to generate chromosomal mutations in E. coli K-12 strains. However, we have had limited success with these lambda-Red based recombination techniques in pathogenic E. coli strains, which has led us to develop an enhanced protocol for recombineering in such strains. \ud \ud Results: Our goal was to develop a high-throughput recombineering system, primarily for the coupling of genes to epitope tags, which could also be used for deletion of genes in both pathogenic and K-12 E. coli strains. To that end we have designed a series of donor plasmids for use with the lambda-Red recombination system, which when cleaved in vivo by the I-SceI meganuclease generate a discrete linear DNA fragment, allowing for C-terminal tagging of chromosomal genes with a 6xHis, 3xFLAG, 4xProteinA or GFP tag or for the deletion of chromosomal regions. We have enhanced existing protocols and technologies by inclusion of a cassette conferring kanamycin resistance and, crucially, by including the sacB gene on the donor plasmid, so that all but true recombinants are counter-selected on kanamycin and sucrose containing media, thus eliminating the need for extensive screening. This method has the added advantage of limiting the exposure of cells to the potential damaging effects of the lambda-Red system, which can lead to unwanted secondary alterations to the chromosome. \ud \ud Conclusion: We have developed a counter-selective recombineering technique for epitope tagging or for deleting genes in E. coli. We have demonstrated the versatility of the technique by modifying the chromosome of the enterohaemorrhagic O157:H7 (EHEC), uropathogenic CFT073 (UPEC), enteroaggregative O42 (EAEC) and enterotoxigenic H10407 (ETEC) E. coli strains as well as in K-12 laboratory strains

    Recovery of a medieval Brucella melitensis genome using shotgun metagenomics

    Get PDF
    Shotgun metagenomics provides a powerful assumption-free approach to the recovery of pathogen genomes from contemporary and historical material. We sequenced the metagenome of a calcified nodule from the skeleton of a 14th-century middle-aged male excavated from the medieval Sardinian settlement of Geridu. We obtained 6.5-fold coverage of a Brucella melitensis genome. Sequence reads from this genome showed signatures typical of ancient or aged DNA. Despite the relatively low coverage, we were able to use information from single-nucleotide polymorphisms to place the medieval pathogen genome within a clade of B. melitensis strains that included the well-studied Ether strain and two other recent Italian isolates. We confirmed this placement using information from deletions and IS711 insertions. We conclude that metagenomics stands ready to document past and present infections, shedding light on the emergence, evolution, and spread of microbial pathogens

    Creation of Golden Gate constructs for gene doctoring

    Get PDF
    Background: Gene doctoring is an efficient recombination-based genetic engineering approach to mutagenesis of the bacterial chromosome that combines the λ-Red recombination system with a suicide donor plasmid that is cleaved in vivo to generate linear DNA fragments suitable for recombination. The use of a suicide donor plasmid makes Gene Doctoring more efficient than other recombineering technologies. However, generation of donor plasmids typically requires multiple cloning and screening steps. Results: We constructed a simplified acceptor plasmid, called pDOC-GG, for the assembly of multiple DNA fragments precisely and simultaneously to form a donor plasmid using Golden Gate assembly. Successful constructs can easily be identified through blue-white screening. We demonstrated proof of principle by inserting a gene for green fluorescent protein into the chromosome of Escherichia coli. We also provided related genetic parts to assist in the construction of mutagenesis cassettes with a tetracycline-selectable marker. Conclusions: Our plasmid greatly simplifies the construction of Gene Doctoring donor plasmids and allows for the assembly of complex, multi-part insertion or deletion cassettes with a free choice of target sites and selection markers. The tools we developed are applicable to gene editing for a wide variety of purposes in Enterobacteriaceae and potentially in other diverse bacterial families

    The microbial ecology of Escherichia coli in the vertebrate gut.

    Get PDF
    Escherichia coli has a rich history as biology's 'rock star', driving advances across many fields. In the wild, E. coli resides innocuously in the gut of humans and animals but is also a versatile pathogen commonly associated with intestinal and extraintestinal infections and antimicrobial resistance-including large foodborne outbreaks such as the one that swept across Europe in 2011, killing 54 individuals and causing approximately 4000 infections and 900 cases of haemolytic uraemic syndrome. Given that most E. coli are harmless gut colonizers, an important ecological question plaguing microbiologists is what makes E. coli an occasionally devastating pathogen? To address this question requires an enhanced understanding of the ecology of the organism as a commensal. Here, we review how our knowledge of the ecology and within-host diversity of this organism in the vertebrate gut has progressed in the 137 years since E. coli was first described. We also review current approaches to the study of within-host bacterial diversity. In closing, we discuss some of the outstanding questions yet to be addressed and prospects for future research
    corecore